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ABSTRACT 


An  analysis  of  U.  S.  Navy  major  aircraft  accidents 
during  the  period  Fiscal  Year  1972  -  1974  was 
conducted.  Forward  (stepwise)  Multiple  Regression 
techniques  were  employed  on  a  group  of  ten  basic 
variables  considered  time  dependent.  The  multiple 
regression  techniques  were  employed  to  develop 
predictive  equations  for  the  dependent  .  variable, 
Accident  Rate  with  a  view  to  determining  which  of  the 
basic  variable  measures  were  significant  in  accident 
rate  studies  and  if  the  variables  are  unique  to  a 
specific  aircraft  community  or  generally  applicable  to 
all  aircraft. 

Aircraft  considered  independently  were  A-4,  A-6, 
A-7,  and  F-4,  additionally  composites  of  Attack 
aircraft  (A-3,  A-4,  A-5,  A-6,  A-7),  Fighter  aircraft 
(F-4  and  F-8) ,  Propeller  aircraft  (E-1,  E-2,  C-1,  C-2, 
S-2,  P-3,  C-1  17,  C-118,  and  C-130)  and  Helicopters 
(H-1,  H-2,  H-3,  H-46,  and  H-53)  were  considered. 
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I.   INTRODUCTION 


The  increased  sophistication  of  military  aircraft  and 
the  related  increased  dollar  costs  of  procuring  both 
aircraft  and  pilots  places  ever  greater  emphasis  on  the 
importance  of  determining  a  feasible  method  of  reducing 
losses  through  aircraft  accidents.  Many  research  efforts  to 
date  have  dealt  with  determining  the  causal  factors 
underlying  an  aircraft  accident,  then  using  these  causal 
factors,  attempting  to  develop  predictive  models  for 
accident  occurence. 

Aircraft  accidents  have  been  broadly  categorized  in 
terms  of  aircraft  design  malfunctions,  aircraft  equipment 
failures,  pilot  or  flight  personnel  error,  and  weather  as 
primary  causes  for  occurrence  of  mahor  aircraft  accidents. 
An  accident  is  designated  as  a  major  accident  if:  1)  loss 
of  life  is  involved;  2)  complete  loss  of  an  aircraft  is 
involved;  or  3)  substantial  damage  occurs  to  any  aircraft 
involved.  Substantial  damage  is  defined  in  appendix  A  of 
OPNAVINST  3750.6  (series) . 

The  lost  common  cause  of  aircraft  accident  cited  has 
been  pilot  error.  Brictson,  et.  al.  (1969)  studied  a  four 
year  span  of  aircraft  carrier  landing  accidents  involving 
attack  and  fighter  aircraft.  Approximately  seventy-eight 
percent  of  the  accidents  studied  had  pilot  error  as  the 
primary  causal  factor.  Brictson  noted  that  the  majority  of 
the  accidents  were  of  two  types,  hard  landings  and 
undershooting  the  landing  area.  The  small  deck  carriers 
accounted  for  seventy  percent  of  the  total  accidents  even 
though  the  large  deck  carriers  had  more  activity. 


Studies  conducted  for  the  Royal  Air  Force  by  Goorney 
(1965)  dealt  with  the  human  factors  involved  in  pilot  error. 
He  determined  that  pilot  fatigue,  emotional  stress, 
complacency,  lack  of  current  flying  experience  contributed 
to  pilot  error  and,  if  monitored,  could  lead  to  prediction 
of  the  likelihood  of  pilot  error  related  accidents. 

There  are  some  analysts  who  feel  that  if  pilot  error  is 
a  primary  cause  of  accidents  then  the  more  proficient  pilot 
should  make  fewer  errors.  This  belief  leads  to  the 
hypothesis  that  measures  of  pilot  proficiency  could  be  used 
as  predictive  measures.  Keller  (1961)  hypothesized  that 
flight  time  was  positively  correlated  with  pilot 
proficiency.  He  stated  that  were  a  pilot  to  fly  the  proper 
amount  he  would  attain  a  safe  proficient  ability  as  a  pilot. 
The  procedure  of  how  to  determine  the  proper  amount  of 
flight  time  necessary  to  attain  proficiency  and  how  the 
number  of  hours  needed  would  interact  with  fatigue  and 
complacency  were  not  fully  explored. 

Collicot,  et.  al.  (1972)  compared  accident  rates  of 
single-seat  aircraft  with  those  of  dual  piloted  aircraft. 
They  noted  that  if  the  operations  were  about  equal  the  dual 
piloted  aircraft  had  fewer  accidents  per  ten  thousand  flight 
hours  than  the  single  piloted  versions.  Though  the  authors 
refer  throughout  their  study  to  pilot  proficiency  they  also 
allude  to  a  possibility  of  temporary  mental  overload  as  a 
critical  factor  underlying  pilot  error. 

The  determination  of  pilot  error  tends  therefore  to 
expand  to  include  emphasis  on  temporary  mental  overload  as 
well  as  pilot  proficiency  measures.  Efforts  by  Kowalsky, 
et.  al.  (1974)  were  made  to  examine  causal  factors  in  high 
pilot  error  rates.  Previous  efforts  to  reduce  pilot  error 
had  concentrated  on  improving  pilot  proficiency.  Kowalsky 
and   his  co-researchers   used   cluster  analysis  and  pattern 


recognition  techniques  and  discovered  that  the  single  most 
important  causal  factor  was  that,  for  non-training, 
non-midair  accidents,  pilots  were  often  temporarily 
overloaded  and  incorrectly  evaluated  information  presented 
during  the  period  of  overload. 

Many  studies  and  much  effort  has  been  expended  in 
accident  research.  The  extensive  data  base  maintained  by 
the  Naval  Safety  Center  of  accident  related  information 
opens  doors  for  further  statistical  analysis  of  accident 
rates  with  goals  of  constructing  useful  predictive 
mathematical  models. 

Myers  (1974)  hypothesized  that  measures  of  pilot 
proficiency  and  experience  available  in  data  collection 
banks  would  be  sufficient  to  construct  a  predictive  model. 
He  used  statistical  techniques  of  principle  component 
analysis  applied  to  two  groups  of  fifty  pilots.  One  group, 
pilots  who  had  been  involved  in  aircraft  accidents,  the 
second,  pilots  with  no  accidents.  The  results  were  not  as 
good  as  was  desired,  possibly  due  to  the  limited  sample 
sizes  employed. 

A  second  approach  was  used  by  Stucki  and  Maxwell  (1975) 
who  used  the  techniques  of  regression  analysis  applied  to 
data  on  ever  two  thousand  aircraft  accidents  as  collected  by 
the  Naval  Safety  Center.  Their  efforts  dealt  with  pilot 
proficiency  variables,  aircraft  variables  and  type  of  flight 
information.  They  then  applied  regression  analysis  to  the 
composite  group  of  all  accident  involved  aircraft  in  the 
Navy*s  inventory.  This  effort  yielded  a  predictive  equation 
composed  of  four  pilot  related  variables  to  predict 
variations  in  aircraft  accident  rates. 

Work  by  Eobino  (1974)  reported  fluctuation  in  aircraft 
by   months   with   the   month   of  March  significantly  higher. 


Subsequent  efforts  in  this  area  by  Poodk  (1976)  failed  to 
support  the  March  phenomena  and  go  on  to  demonstrate  that 
fluctuations  in  aircraft  accident  rates  by  month  is  purely 
random. 

The  author  of  this  study  believes  that  the  premise 
promoted  by  flyer,  Stucki  and  Maxwell  and  others  is  valid. 
There  should  be  sufficient  data  available  on  current 
aircraft  accidents  to  conduct  detailed  statistical  analysis 
with  the  resultant  predictive  equations  both  meaningful  and 
useful.  The  variable  nature  of  aircraft  accident  rates 
suggest  that  the  underlying  factors  may  be  definable  and,  if 
they  can  be  determined,  used  in  accident  prevention. 

If  statistical  analysis  of  aircraft  accident  rates  can 
provide  information  on  accident  related  variables,  be  they 
pilot-oriented,  aircraft  oriented  or  related  to  some  other 
source,  which  vary  either  directly  or  inversely  with 
aircraft  accident  rates  then  preventative  actions  can  be 
taken  to  suppress  the  enormous  costs  in  dollars  and  human 
life  associated  with  aircraft  accidents. 


II.   NATURE  OF  THE  PROBLEM 


Monthly  accident  rates  exhibit  a  marked  variability  when 
each  calendar  month  is  compared  to  other  months.  The  belief 
that  some  months  are  consistently  higher  that  others  has 
been  noted  frequently  in  studies.  This  phenomena  has  been 
noted  in  studies  of  U.  S.  Air  Force  accident  rates  by  Zeller 
and  March  (1973)  and  by  Robino  (1972)  in  a  study  of  Navy 
aircraft  accident  rates.  Recent  work  by  Poock  (1976)  at  the 
0.  S.  Naval  Postgraduate  School  displays  no  statistical 
basis  for  any  month  being  consistently  high  and  attributes 
the  fluctuations  to  random  effects  of  the  underlying  causal 
factors. 

The  accident  rate  is  defined  as  the  total  number  of 
accidents  in  a  given  month  times  ten  thousand  hours  divided 
by  the  total  number  of  flight  hours  flown  that  month. 

The  efforts  of  this  study,  motivated  by  work  of  Stucki 
and  Maxwell  (1975) ,  were  to  explore  accident  rate  dependence 
on  time  related  variables  by  specific  aircraft  types  where 
possible  and  composites  of  aircraft  types  where  necessary. 
The  results  desired  are  a  series  of  predictive  equations 
unique  to  a  specific  aircraft  or  a  community.  It  is 
believed  that  if  in  fact  aircraft  type  has  no  large  effect 
on  accident  rates  that  the  data  will  yield  similar  equations 
for  each  type  considered. 
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III.   ANALYTICAL  PROCEDURES 


This  chapter  contains  the  data  selection  procedure,  the 
techniques  employed  in  data  preparation,  a  description  of 
the  analysis  procedures  and  a  summary  of  decision  criterion 
employed  in  selecting  the  best  equation  for  predicting  the 
variance  in  the  dependent  variable  rate. 


A.   DATA  SOURCE 


All  Navy  and  Marine  aircraft  accidents  and  incidents  are 
reported  in  detail  to  the  Naval  Safety  Center,  NAS,  Norfolk, 
Va.  The  reporting  criteria  is  detailed  in  Navy  Aircraft 
Accident,  Incident  and  Ground  Reporting  Procedures 
(0PNA7INST  3760.6  (series) )  .  As  Naval  Safety  Center  is  a 
repository  for  all  data  recorded  on  aircraft  accidents  they 
are  the  source  of  data  used  in  this  report. 


B.   DATA  SELECTION 


As  the  goal  of  this  study  is  to  apply  the  concept 
envisioned  by  Stucki  and  Maxwell  (1975)  to  individual  type 
aircraft  where  possible  and  to  group  type  of  ai  rcraft  where 
necessary,  the  same  basic  data  set  as  provided  Stucxi  and 
Maxwell  by  the  Naval  Safety  Center  was  employed. 

Table  1  lists  the  data  initially  requested  from  and 
provided  by  Naval  Safety  Center. 
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TABLE  1 


DATA  SET  REQUESTED  FROM  NAVAL  SAFETY  CENTER 


Data  concerning  the  pilot: 

1.  Age 

2.  Injuries 

3.  Number  of  previous  service  tours 

4.  Total  flying  time  in  aircraft  model  in  which 
accident  occurred 

5.  Total  flight  hours  in  previous  ninety  days 

6.  Total  nighttime  flight  hcurs  in  previous  ninety  days 

7.  Total  daylight  carrier  landings  in  previous  thirty 
days 

8.  Total  night  carrier  landings  in  previous  thirty  days 

9.  Number  of  years  as  designated  Naval  Aviator 

Data  concerning  aircraft: 

1.  Model 

2.  Damage 

3.  Number  of  tours  between  major  aircraft  rework 

4.  Type  of  last  major  inspection 

5.  Hours  since  last  inspection 

6.  Identification  of  the  system  or  component  failure 

Data  concerning  the  flight: 

1 .  Major  command 

2.  Reporting  custodian 

3.  Ships1 s  hull  number  (if  applicable) 

4.  Marine  Air  Wing  (if  applicable) 

5.  Location 

6.  Flight  Purpose  Code 

7.  Type  of  operation  code 

8.  Phase  of  operation  in  which  the  accident  occurred 

Data  concerning  the  accident: 

1.  Accident  identification  number  including  calendar 
date 

2.  Other  aircraft  damaged 

3.  Other  personnel  injured 

4.  Contributing  causal  factors 

5.  Special  data  not  otherwise  listed 

6.  Heather 

7.  Accident  rate   for   the    month   in    which   the   accident 
occurred 


From      the      available      data      set   ten   basic    variables    were 
selected    in   cooperation    with    Naval  Safety      Center      personnel 
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for  inclusion  in  this  study  (see  Table  2) . 

In  general,  multiple  regression  requires  that  variables 
are  measured  on  interval  or  ratio  scale  and  that  the 
relationships  among  the  variables  are  linear  and  additative. 

The  data  set  provided  consisted  of  information  on 
Two-Thousand-One-Hundred-Ten  accidents  or  incidents  which 
occurred  during  Fiscal  lears  1969  to  1974,  inclusive. 
Selection  of  a  suitable  time  span  was  based  upon  the 
following  basic  constraint  considerations: 

(1)  The  necessity  to  have  as  large  a  sample  size  as 
possible  to  enhance  validity  of  statistical  inferences. 

(2)  The  desire  to  restrict  the  years  in  the  sample  to 
periods  when  the  aircraft  inventory  was  reasonably 
consistent. 

(3)  The  necessity  to  consider  gaps  and  inconsistencies 
in  desired  data  points  due  to  changes  and/or  modifications 
in  data  collection  requirements  and  recording  procedures 
that  had  occurred  within  the  Accident  Reporting  System. 

To  accomplish  these  major  considerations  the  entire  data 
base  was  initially  included.  Then  each  variable  was 
examined  throughout  the  entire  data  set  in  terms  of  how 
changes  in  the  reporting  procedure  or  inconsistencies  in  the 
data  would  effect  that  variable.  This  treatment  resulted  in 
the  reduction  of  the  data  base  to  the 
Five-Hundred-Sixty-Nine  accidents  occurring  during  the  three 
year  period  FY  72-74  inclusive. 

During  this  period  the  inventory  of  aircraft  with  which 
this  study  deals  was  reasonably  constant. 

While  this  treatment  did  create  a  complete  data  base 
wherein  all  information  desired  was  available  for  all 
accidents  the  resulting  size  does  hamper  the  investigation 
of  aircraft  types  in  cases  where  the  inventory  is  small  to 
begin  with  and/or  where  there  are  few  accidents  as  in  the 
A-3  community.  This  leads  in  some  instances  to  grouping 
data  under  categories  such  as  Propeller,  Attack  or  Helos. 
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C.   VARIABLE  SELECTION 

The  ten  fcasic  variables  selected  for  inclusion  are  shown 
in  Table  2. 


TABLE  2 


DATA  SET  INCLUDED  IN  CURRENT  STUDY 


1.  Accident  rate  by  month  (RATE) 

2.  Pilots  age  (AGE) 

3.  Total  flight  time  in  accident  involved  aircraft 
model  (TTIME) 

4.  Total  flight  time  during  ninety  days  preceding 
accident  (TOT90) 

5.  Total  night  flight  time  during  the  preceeding  ninety 
nights  (NITE90) 

6.  Daylight  carrier  landings  during  the  preceeding 
thirty  days  (CLDAY) 

7.  Night  carrier  landings  during  the  preceeding  thirty 
nights  (CLNITE) 

8.  Number  of  aircraft  tours  (ACTOUR) 

9.  Aircraft  flight  hours  since  last  major  or  calender 
inspection  (ACHRS) 

10.  Number  of  years  designated  Naval  Aviator  (DNA) 


In  addition  to  the  basic  variables  the  author  used  an 
eleventh  variable,  DAY90  =  TOT90  -  NITE90,  which  is  the 
total  daylight  flight  time  in  the  preceeding  ninety  days. 

Pilots  age  and  years  designated  Naval  Aviator  were 
included  as  they  are  variables  that  are  historically  used  as 
indicators  of  maturity  and  perhaps  proficiency.  If,  as  the 
author  believes,  the  hypothesis  that  the  older  pilots  tend 
to  be  safer  pilots  through  a  finer  sense  of  judgement  of 
risks  involved  is  a  valid  hypothesis,  the  author  would 
expect  a  negative  simple  correlation  between  AGE  and  DNA 
with  rate.  However  because  the  number  of  other  confounding 
factors  is  great  a  negative  correlation  would  not  justify 
the  acceptance  of  the  hypothesis. 

The  variables  consisting  of  pilot  flight  hours  and 
carrier  landings  are  considered  to  be  measures  of  pilot 
currency   and   proficiency   by   many   in   the   Navy   and  are 
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therefore  included. 

Aircraft  tours  is  included  as  a  measure  of  the  general 
condition  of  the  aircraft  and  as  an  indication  of  aircraft 
age.  Each  aircraft  in  the  Navy's  inventory  undergoes  a 
Periodic  Aircraft  Rework  (PAR)  for  analysis,  repair  and 
conversion  at  intervals  unique  to  the  model  aircraft  after  a 
specific  number  of  flight  hours.  This  variable  also  serves 
to  monitor  any  reliability  anomalies  other  than 
"new-bet ter-than-used"  as  mentioned  by  Butterworth,  et.al. 
(1974) . 

Aircraft  hours  is  included  as  a  measure  of  aircraft 
condition  and  usage  since  major  inspection,  primarily  the 
calendar  inspections. 


D.   DATA  PREPARATION 

The  basic  assumptions  for  multiple  regression  analysis 
require  that  data  be  measured  in  at  least  interval  or  ratio 
scale  and  that  the  relationship  among  the  variables  be 
linear  and  additive. 

All  data  points  used  were  ajudged  to  be  measured  on  an 
interval  scale.  Raw  data  for  each  type  or  group  of  aircraft 
was  averaged  by  months  for  each  of  the  thirty-six  months 
included  in  the  data  set  where  there  was  an  accident  for 
that  type  aircraft. 


E.   THE  ANALYSIS  TECHNIQUE 

The  analysis  procedure  employed  was  Multiple  Regression 
using  Forward  (stepwise)  Inclusion.  The  Statistical  Package 
for  the  Social  Sciences (SPSS)  compiled  and  edited  by  Nie, 
et.al.  (1955)  includes  a  forward  stepwise  multiple  regression 
computer  program  package  developed  by  Jae-On  Kim  and  Frank 
J.  Kohout  at  the  University  of  Iowa. 
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This  package  was  selected  as  the  means  of  conducting  the 
statistical  analysis  of  the  data  sets. 

Kim  and  Kohout  state  that  forward  stepwise  multiple 
regression  is  a  recognized  technique:  "(1)  "to  find  the  best 
linear  prediction  equation  and  evaluate  its  prediction 
accuracy;  (2)  to  control  for  other  confounding  factors  in 
order  to  evaluate  the  contribution  of  a  specific  variable  or 
set  of  variables;  and  (3)  to  find  structural  relations  and 
provide  explanations  for  seemingly  complex  multivariate 
relationships,  such  as  is  done  in  path  analysis." 

The  computer  program  provides  the  user  with  various 
options  for  treatment  of  data  sets,  calculation  of 
statistics  and  output  formats.  The  procedure,  Listwise 
Deletion  of  Missing  Data,  is  the  default  option  and  the  most 
conservative  in  that  it  maintains  sample  size  and  is  the 
most  accurate.  Since  the  data  base  finally  arrived  at  was 
complete,  no  data  was  deleted  by  the  computer  program 
option.  If  the  data  base  were  missing  quite  a  few 
individual  data  points  this  procedure  could  result  in  a 
drastic  decrease  in  the  sample  size.  This  fact  was  one  of 
the  underlying  considerations  in  the  data  base  selection 
criteria. 

The   options   not   selected   could,   if   not   used   with 

considerable   prudence   and   judgement,    result   in  the 

introduction  of  large  amounts  of  bias   that   would   be   very 

difficult   to  detect   without  a  very  good  feel  for  expected 

experimental  results.    It   is   for   this   reason   that  the 

Listwise  Deletion   of  Missing  Data  option  was  used  for  all 

computer  runs  in  the  current  study.   Kim   and  Kohout  state 

that: 

"There  are  many  occasions  for  which  simple  linear 
models  are  inadequate.  It  may  be  that  (1)  the  bivariate 
relationship  is  expected  (on  the  basis  of  theory)  to  take 
a  specific  nonlinear  form,  (2)  the  bivariate  relationship 
is  simply  unknown,  and  the  examination  of  the  scatterplots 
suggests  clear  deviation  from  linearity  or,  at  least,  the 
need  for  testing  the  adequacy  of  the  linearity  assumption, 
or  (3)  the  combined  effects  of  the  independent  variables 
are  not  additive.  Some  of  the  ways  to  handle  these  types 
of  nonlinear  situations  are  (1)  to  transform  the  original 
variables  in  such  a  way  that  the  resultant  relationships 
among  the  transformed  variables  become  linear,  (2)  to  find 
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a  simple  nonlinear  form  through  the  use  of  polynomial 
regression,  and  (3)  to  introduce  interaction  terms  as  new 
variables. " 


There  are  two  extreme  viewpoints  in  regression 
analysis,  with  valid  arguments  to  support  both  cases. 
Draper  and  Smith  (1966)  explain  that  the  two  opposing 
viewpoints  are  "(1)  to  make  the  prediction  equation  valid 
you  should  include  as  many  predictor  variables  as  possible; 
and  (2)  because  of  increased  cost  of  obtaining  variables  and 
monitoring  them,  the  equations  should  include  as  few 
variables  as  possible." 

The  process  of  selecting  the  best  regression  equation 
is  the  process  of  compromising  between  these  two  extreme 
viewpoints.  There  is  no  unique  statistical  procedure  for 
choosing  the  'best'  equation  and  large  amounts  of  personal 
judgement  are  required.  In  this  regard  the  techniques  of 
regression  analysis  become  an  art  as  well  as  a  science. 

Initial  analysis  of  data  from  the  current  study 
displayed  indications  of  nonlinearity  and  interactive 
effects  between  independent  variables. 

To  deal  with  these  effects  no  single  set  of  regression 
variables  were  deemed  'best'  but  rather  a  series  of  seven 
different  regression  variable  packages  were  constructed. 
Each  data  set  was  run  with  all  seven  different  packages. 

Variables  eight  and  nine  from  Table  2  were  deemed 
aircraft  oriented  variables  while  the  remaining  basic 
variables  were  considered  human  oriented  measures. 

Regression  I  consisted  of  only  those  basic  variables 
that  ware  human  related.  Regression  II  contained  all  the 
basic  variables  pilot -or  aircraft  oriented.  Regression  III 
used  the  pilot  oriented  basic  variables  plus  transformations 
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consisting  of  the  square  and  the  square  root  of  each  basic 
variable  used.  Regression  IV  includes  all  of  Regression  III 
plus  the  square  of,  the  square  root  of  and  the  two  aircraft 
related  basic  variables.  Regression  V  contains  all  of 
Regression  III  plus  the  twenty-eight  possible  cross-products 
of  the  eight  basic  variables.  Regression  VI  contains  all  of 
Regression  V  plus  the  aircraft  related  basic  variables, 
their  squares  and  square  roots  and  the  cross  product  of 
Aircraft  Tours  and  Aircraft  Hours.  Regression  VII  contains 
the  ten  basic  independent  variables,  their  squares,  square 
roots  and  the  forty-five  possible  cross-products. 

The  decision  to  employ  squares  and  square  roots  was 
made  to  provide  a  larger  number  of  variables  capable  of 
accounting  for  curvilinearty.  The  introduction  of 
cross-products  allows  for  interactive  effects  of  independent 
variables.  Ihe  use  of  the  Forward  (stepwise)  Inclusion 
Multiple  Regression  computer  program  facilitates  the 
creation  and  inclusion  of  many  various  transforms.  The 
packages  used  in  the  study  were  considered  the  most 
versatile  of  the  trial  packages  used  in  preliminary  studies 
by  the  author. 


F.   DECISION  CRITERIA 


The  forward  stepwise  multiple  regression  program 
contains  preselectable  stopping  criteria  that  were  adjusted 
to  facilitate  introduction  of  variables  into  the  equation 
that  by  themselves  made  a  significant  contribution  to 
explaining  the  variance  in  the  dependent  variable  RATE.  As 
a  • rule-of-thumb'  in  predictive  equation  selection  the  study 
attempts  to  restrict  the  number  of  variables  in  each 
equation  to  five.  This  decision  is  based  upon  the  degrees 
of  freedom  in  the   regression   equation   and  the   need   to 
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maintain   a  significant  ratio  to  provide  a  solid  statistical 
base  for  conclusions. 

With  the  degree  of  freedom  restrictions  attained  the 
primary  decision  criteria  are: 

1)  The  equation  with  a  significance  level  of 
100  (1-alfa) percent  greater  than  or  equal  to  ninty-five 
percent,  and 

2)  That  equation  that  accounted  for  the  largest  amount 
of  variance  in  the  dependent  variable. 

In  those  cases  where  the  choice  of  the  •best'  equation 
was  not  clearly  indicated  other  more  subjective  measures 
were  employed,  such  as,  examination  of  scatterplots  of  the 
standardized  residual  versus  the  standardized  predicted 
dependent  variable,  the  plot  of  the  standardized  residuals, 
and  consideration  of  the  intuitive  impact  of  the  particular 
variables  in  the  eguations  under  consideration. 

For   example,    all   other   decision   criteria   being 
statistically  equal  the  equation  containing  CLDAY  -    (CLDAI) 2 
+   (RTCLDAY)   would  be  selected  over  the  equation  containing 
(AGE)  (ACTO0BS)  +  (TTIME)  (ACHRS)  . 

Since  many  of  the  regression  packages  were  very  similar 
some  cases  cculd  yield  the  same  eguations  for  more  than  one 
regression  package,  while  other  cases  could  yield  no 
significant  equation  for  any  regression  package.  In  either 
case  for  completeness  the  •best'  eguations  is  indicated  in 
the  results  even  if  that  equation  is  not  statistically 
significant. 

This  chapter  has  described  the  analysis  procedures  used 
in   the   development  of  predictive  equations  for  variance  in 
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accident  rates  by  month.  The  next  chapter  contains  the 
results  of  the  analysis  by  aircraft  community  and  aircraft 
type  where  possible. 
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17.   RESULTS 


The  results  by  aircraft  type  or  aircraft  community  are 
contained  in  this  chapter.  The  best  predictive  equation 
provided  by  the  seven  Regression  Packages  are  shown. 


A.   ATTACK  AIRCRAFT 


The  aircraft  of  the  attack  community  were  divided  into  a 

composite  regression  and   three  separate,  regressions.  The 

composite  consisted  of  accident   involved   aircraft   of  the 

types   A-3,   A-4,   A-5,   A-6,  and  A-7.   All  variants  of  each 

type  aircraft  were  included  (for   example,   KA-3,   EA-3  and 

A-3) .   The  three  separate  regressions  were  conducted  on  A-4, 
A-6,  and  A-7  respectively. 

1.   Attack  Composite 

This  category  was  used  to  investigate  trends  unique 
to  the  Attack  community  and  not  peculiar  to  a  specific 
attack  type  aircraft.  Additionally,  the  relatively  small 
size  of  the  A-3  and  A-5  communities  precluded  an  independent 
analysis  of  these  aircraft.  Rather  than  omit  A-3*s  and 
A-5's  they  were  included  here.  The  accident  involved 
aircraft  in  this  category  provided  the  maximum  sample  size 
of  thirty-six  data  points  for  analysis.  As  the  number  of 
aircraft  included  of  each  type  are  not  equal  the  category 
may  be  biased  towards  the  larger  A-4  and  A-7  communities, 
however,  the  author  felt  that  this  would  in  no  way   endanger 
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the  results  as  the  Attack  community  is  being  considered  as  a 
community.  Hhile  data  was  available  for  A-1»s  they  were  not 
included  for  two  reasons,  firstly,  it  was  desired  to  limit 
the  category  to  jet  aircraft  and  secondly,  the  A-1  has  been 
phased  out  of  the  active  aircraft  inventory. 

The  basic  variables  in  the  regression  accounted  for 
less  than  twenty  percent  of  the  variance  in  accidents  rate, 
however,  the  regressions  consisting  of  transformed  variables 
yielded  two  equations  of  approximately  equal  quality  which 
are: 

A.  Rate(ATTACK)  =  0.98593  -  0.01221  (ACTO0"H)2  + 
0.00232  (ACHES)  -  0.00226  (CLDAI) 2  +  0.62245  (RTCLNITE)  - 
0.00193  (NITE90)2 


and. 


B.  Rate(ATTACK)  =  0.79467  -  0.00268  (CLDAY) 2  + 
0.68955  (RTCLNITE)  -  0.00664  (NITE90) 2  +  0.15186  (NITE90)  - 
0.09602  (BTTOT90) . 

Equation  A  and  equation  B  are  both  significant  at 
the  99  percent  level  and  equation  B  accounts  for  59.74 
percent  of  variance  in  rate  contrasted  to  51.11  percent  for 
equation  A.  The  author*s  decision  criteria  were  met  by  both 
equations,  however,  examination  of  residual  plots  favored 
equation  A  by  a  narrow  margin.  The  Forward  (stepwise) 
Regression  criteria  selects  variables  for  inclusion  in  the 
predictive  eguation  by  adding  the  variable  that  accounts  for 
the  largest  increase  in  the  percent  of  variance  in  the 
dependent  variable.  It  is  of  interest  to  note  that  the 
aircraft  oriented  variables  entered  the  predictive  equation 
first  in  Equation  A  followed  by  the  pilot  oriented 
variables.  With  the  deletion  of  aircraft  variables  the 
second   equation   provided   by  only  pilot  oriented  variables 
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accounted  for  an  additional  8.63  percent  of  variance  in  rate 
but  with  a  lesser  initial  effect  of  the  first  two  variables 
added. 

It  can  be  observed  that  equation  A  contains  two 
aircraft  related  variables   while   equation  B  is   composed 

entirely   of  pilot  oriented   variables   of  which  three  are 

included  in  both  equations.    The   predictive  equation  (B) 

contains  the  variable  NITE90  in  two  functional  forms.  The 

net  effect  on  the  dependent  variable  rate   is  positive  for 

values   of   NITE90   less  than  or  equal  to  22.87  hours.  For 

hours  greater  than  22.87  the  effect  is  to  reduce  the 
predicted  monthly  accident  rate. 

2.   A-  ;  Aircraft 

This  category  contains  all  accident  involved  A-4  and 
TA-4  aircraft  in  the  three  year  period  studied  and  provides 
a  sample  size  of  thirty-one  cases.  The  regression  of  the 
basic  variables  accounted  for  only  22  percent  of  variance  in 
rate  at  a  significance  level  of  75  percent.  The  predictive 
equation  considered  •best*  was: 

Hate  (A-4)  =  -0.00804  (DNA) 2  +  0.10473  (RTACHRS) 
-0.00010  (DAY90)  2  +  0.77584  (RTNITE90)  -  0.11160  (NITE90)  - 
0.13246. 

This  equation,  significant  at  the  95  percent  level, 
accounts  for  42.68  percent  of  the  variance  in  rate. 

It  is  noted  that  four  of  the  five  variables  are 
pilot  oriented  variables  three  of  which  are  based  on  hours 
flown.  The  predictive  equation  for  A-4  aircraft  contains 
MITE90  in  two  functional  forms  as  did  the  predictive 
equation  for  the  Attack  community.   Again  as  in  the   Attack 
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community  the  net  effect  of  NITE90  is  positive  for  the  lower 
number  of  hours  flown.  Particularly  the  net  effect  is 
positive  for  NITE90  less  than  or  equal  to  forty-eight  hours 
and  negative  for  values  greater  than  forty-eight. 

3.   A-6  Aircraft 

This  category  was  restricted  to  a  sample  size  of 
twenty  due  to  relatively  few  accidents  and  a  smaller 
community.  In  order  to  achieve  the  largest  sample  size 
possible  the  author  included  EA-6  aircraft  with  the  A-6  and 
KA-6  models.  The  small  sample  size  tends  to  make  suspect 
any  results  derived  by  regression  analysis. 

The  basic  variables  accounted  for  less  than  ten 
percent  of  the  variance  in  rate  while  the  'best'  predictive 
equation  accounted  for  40.27  percent.  This  equation  is 
however  significant  only  at  the  75  percent  level.  The 
author  feels  that  the  small  sample  size  tends  to  negate  any 
usefulness  of  this  regression.  The  equation  is  included  for 
continunity  of  the  study  and  for  discussion  purposes.  The 
equation  is: 

Rate  (A-6)  =  16.28967  -  0.04604  (DNA) 2  ♦  2.33592 
(RTDNA)  -  20.30561  (RTACTOOR)  +  5.34649  (ACTOOR)  ♦  0.05874 
(RTTIME) . 

It  is  noted  here  that  only  three  basic  variables  are 
used  in  seme  functional  form  with  a  balance  of  aircraft  and 
pilot  oriented  variables  used.  The  independent  variables 
DNA  and  ACTOOR  are  each  used  in  two  functional  forms.  The 
net  effect  of  ACTO0RS  is  positive  for  values  greater  than  or 
equal  to  15  tours  while  the  net  effect  of  DNA  on  the 
dependent  variable  rate  is  positive  for  values  less  than  or 
equal  to  13.70  and  becomes  negative  for  values  greater   than 
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13.7  years. 

4.   A^7  Aircraft 

This  category  provides  a  sample  size  of  thirty-two 
cases  based  on  all  A-7  aircraft  models  involved  in  accidents 
during  the  study  period.  The  regression  analysis  yielded 
the  following  predictive  equation: 

Bate  (A-7)  =  0.27170  (RTCLDAY)  -  0.01856  (CLNITE) 2  ♦ 
0.21346  (HTNITE90)  +  4.01164  (RTDNA)  -  0.88896  (DNA)  - 
3.70545. 

This  equation,  significant  at  the  99  percent  level 
accounts  for  55.01  percent  of  the  variance  in  rate. 

The  predictive  equation  for  A-7  aircraft  also 
contains  two  functional  forms  of  DNA.  Again  here  as  in  the 
A-6  the  net  effect  is  negative  for  large  values  of  DNAr 
greater  than  21  years  in  this  case.  While  the  effect  on 
rate  is  positive  for  values  less  than  21  years  the  net 
effect  decreases  as  the  value  of  DNA  approachs  21  years. 
This  agrees  with  the  intuitive  feeling  that  DNA  is  a  measure 
of  pilot  proficiency  and  the  more  experience  the  safer  the 
pilot. 
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TABLE  3 


ATTACK  AIBCfiAFT  VARIABLE  SUMMARY 


7ABIABLE 

ATI 

CACK-A 

ATI 

:ack-b 

A-4 

A- 6 

A- 7 

FBEQ 

NITE90 

0, 

.07382 

0. 

.12366 

2 

ACTOOB 

-0. 

.17445 

ACHBS 

0. 

,31152 

DNA 

0. 

.15431 

DAY902 

-0. 

.12592 

NITE902 

-0. 

04184 

-0. 

04184 

1/1 

CLDAY2 

-0. 

,31728 

-0. 

31728 

1/1 

CLNITE2 

-0. 

,23345 

ACTOUB2 

-0. 

37244 

DNA2 

-0. 

.31353 

-0 

.34703 

2 

RTTIME 

-0. 

.03596 

BTTOT90 

0. 

06426 

RTNITE90 

0. 

22177 

0. 

.22398 

2 

RTCLDAY 

0. 

.34684 

BTCLNITE 

0. 

13914 

0. 

13914 

1/1 

RTACTOUR 

-0. 

.14537 

BTACHBS 

0. 

30860 

BTDNA 

-0. 

24288 

0. 

22979 

2 

TABLE  ENTBIES  ARE  THE  CORRELATION  COEFFICIENT  OF  THE 
DISPLAYED  VARIABLE  WITH  RATE 


Table  3  displays  the  basic  and  transformed  variables 
as  used  in  the  regression  package.  The  suffix  '2*  indicates 
that  variable  squared  and  the  prefix  'RT'  the  square  root  of 
the  variable.  Tabled  are  the  correlation  coefficients  of 
the  displayed  variables  with  the  dependent  variable  rate. 

For  those  cases  where  the  variable  appears  in  more 
than  one  equation  the   correlation   coefficients   are   quite 
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consistant  with  the  exception  of  RTDNA  where  the  coefficient 
for  A^7  is  contrary  to  what  would  be  normally  considered 
true.  While  it  is  generally  believed,  the  remaining 
aircraft  types  bear  out,  that  the  adage  that  the  older,  more 
experienced  pilot  has  fewer  accidents,  this  does  not  appear 
to  hold  for  the  A-7  aircraft.  It  is  noted  that  the  values 
for  DMA  and  RTDNA  in  Table  3  for  A-7  are  positively 
correlated,  in  addition  the  coefficient  for  pilot  age  is 
also  positively  correlated  for  A-7.  The  author  is  unable 
from  his  experience  to  explain  this  unusual  occurance. 


TABLE  4 


ATTACK  AIRCRAFT  BASIC  VARIABLE  SUMMARY 


VARIABLE 

ATTACK- 

■A 

ATTACK- 

-B 

A-4 

A- 6 

AGE 

TTIME 

1 

TOT90 

1 

DAY90 

1 

NITE90 

1 

2 

2 

CLDAY 

1 

1 

CLNITE 

1 

1 

DNA 

1 

2 

ACTOOR 

1 

2 

ACHRS 

1 

1 

A-7 


FREQ 
0/0 
1/1 
0/1 

1/1 

4/5 
2/2 
2/2 

5/5 
3/2 
2/1 


Table  4  relates  the  usage  of  basic  variables  in  each 
category  in  some  functional  form.  Basic  variables  AGE, 
TTIME,  TOT90,  and  DAI90  are  used  one  time  or  less  indicating 
that  these  measures  have  little  or  no  effect  on  predicting 
variance  in  aircraft  accident  rates.  NITE90  and  DNA  are  the 
high  usage  variables  followed  by  CLDAY,  CLNITE,  ACTOUR  and 
ACHRS  with  moderate  usage. 
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B.   FIGHTEB  AIRCRAFT 

The  analysis  of  fighter  aircraft  was  restricted  to  F-4 
and  a  composite  of  F-4  and  F-8  aircraft.  The  data  base  did 
not  provide  enough  data  to  conduct  independent  analysis  of 
F-S's  by  themselves  which  led  to  the  composite  category. 

1 .   Fighter  Composite 

This  category  was  included  to  provide  a  method  of 
including  F-8  aircraft  and  to  facilitate  the  possible 
contrast  of  the  Attack  and  Fighter  communities. 

The  composite  analysis  yeilded  a  sample  size  of 
thirty-six  cases  primarily  on  the  strength  of  the  F-4 
community.  The  basic  variable  regression  accounted  for  less 
than  twenty-five  percent  of  the  variance  in  rate  at  a 
significance  level  of  95  percent.  Once  again  the  regression 
using  transformed  variables  provided  a  better  predictive 
eguation  as  shown  below. 

Rate  (Fighter)  =  1 .21906 (RTACTOUR)  ♦  0.23768 
(RTDAY90)  +  0.01897  (CLDAY) 2  ♦  2.38695  (RTCLDAY)  -0.92126 
(CLDAY)  -  2.48215. 

This  eguation  accounts  for  40.45  percent  of  the 
variance  in  rate  and  is  significant  at  the  99  percent  level. 

It  is  observed  that  the  variable  CLDAY  appears  in 
each  of  its  functional  forms  and  while  it  does  not  account 
for  the  most  variance  initially  in  conjunction  with  the 
forms  of  ACTO0R  and  DAY90  it  adds  about  sixteen  percent  to 
the  accounting  of  variance  in  rate.  The  net  effect  on  the 
dependent  variable  rate  of  the  variable  CLDAY  is  positive 
for  values  less  than  or  equal  to  11.56  daytime  carrier 
landings  in  thirty  days.   For  values  greater  than  11.56   the 
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net   effect   becomes  negative  and  will  tend  to  decrease  the 
accident  rate. 

2.  IzH   Aircraft 

The  category  of  F-4  aircraft  consisted  of  a  sample 
size  of  thirty-six  data  points.  Regression  analysis  yielded 
the  following  predictive  equation: 


Rate  (F-4)  =  0.19142  (RTDAY90)  -  0.03663  (CLNITE) 2  - 
0.000002  (TTIME)  2  +  0.17982  (RTTIME)  -0.01302  (DNA)  2  - 
1.59073. 

This  equation  accounts  for  34.79  percent  of  variance 
of  rate  at  significance  level  95  percent.  The  equation 
generated  by  the  basic  variables  alone  accounted  for  less 
than  nine  percent  and  were  not  significant  at  the  75  percent 
level. 

The  predictive  equation  deemed  'best'  was  generated 
from  Regression  III  and  contained  only  pilot-oriented 
variables. 

The  variable  TTIttE  appears  here  in  two  functional 
forms  with  a  positive  net  effect  on  rate  for  values  less 
than  or  equal  to  2006  hours. 
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TABLE  5 


FIGHTER  AIRCRAFT  VARIABLE  SUMMARY 


VARIABLE 

FIGHTER 

F-4 

FREQ 

CLDAY 

-0.16666 

- 

2 

TTIME2 

-0.11285 

1 

CLDAY2 

-0.26346 

1 

CLNITE2 

-0.19546 

1 

DNA2 

-0. 16277 

1 

RTTIME 

0.08688 

1 

RTDAY90 

0.24865 

0.23319 

2 

RTCLDAY 

0.00643 

1 

RTACTOOR 

0.44776 

• 

1 

TABLE  ENTRIES  ARE  THE  CORRELATION  COEFFICIENTS  OF  THE 
DISPLAYED  VARIABLE  WITH  RATE 


In  the  case  RTDAY90  which  appears  in  both  equations 
the  correlation  coefficients  are  quite  consistent.  It  is 
noted  that  the  variables  normally  considered  as  pilot 
proficiency  variables  are  negatively  correlated  with  rater 
except  in  the  cases  where  square  roots  are  used. 
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TABLE  6 


FIGHTER  AIBCRAFT  BASIC  VARIABLE  SUMMARY 


VARIABLE 

AGE 

TTIME 

TOT90 

DAY90 

NITE90 

CLDAY 

CLHITE 

DNA 

ACTOUR 

ACHRS 


PIGHTER 


F-4 


FREQ 
0 
2 
0 
2 
0 
3 
1 
1 
1 
0 


Table  6  relates  the  usage  of  basic  variables  in  some 
functional  form  by  category.  The  basic  variables  CLDAY, 
TTIME,  and  DAY90  are  the  high  usage  variables  in  this 
category  and  are  all  pilot  related  variables.  The  variables 
AGE,  TOT90,  NITE90  and  ACHRS  did  not  appear  in  any  form  in 
the  fighter  community 


C.   PROPELLER  AIRCRAFT 

The  aircraft  considered  in  the  propeller  aircraft 
category  consisted  of  E-1,  E-2r  C-1,  C-2,  S-2,  P-3,  C-117, 
C-118,  and  C-130.  Due  to  the  relatively  small  size  of  each 
individual  community  and  the  infrequency  of  accidents  it  was 
necessary  to  combine  all  aircraft  into  one  category  entitled 
•PROPS'.  This  procedure  is  somewhat  unnerving  as  there  are 
normally   aspirated  and  turboprop  aircraft  together  as  well 
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as  carrier-based  and  land-based  aircraft.  This  tends  to 
bias  the  results  and  applications  or  inferences  cannot  be 
directed  toward  any  particular  member  aircraft  in  the  group. 

The  result  of  aggregating  the  above  aircraft  is  a  sample 
size  of  twenty-six  cases  which  provides  an  equation  of  basic 
variables  that  accounts  for  less  than  thirty  percent  of  the 
variance  in  rate  at  a  significance  level  of  seventy-five 
percent. 

Begression  packages  V,  VI  and  VII  provided  the  same 
equation  which  was  judged  'best'  by  the  author. 

Rate  (PROPS)   =   0.35935   +    0.00002[  (TTIHE)  (NITE90  ] 
0.00022    (NITE90)  2    -   0.00001   [  (AGE)  (TTIME)  ]   +   0.00108 
[  (CLDAI)  (ENA)  ]  -0.  00595  (CLNITE)  *. 

The  predictive  equation  accounts  for  43.25  percent  of 
the  variance  in  rate  at  a  significance  level  of  95  percent. 
This  category  provides  the  only  case  where  cross-products 
contribute  to  the  predictive  equation.  The  equation 
consists  of  only  six  basic  variables  in  some  functional 
form,  and  all  six  are  pilot  oriented  variables.  This  could 
be  indicative  of  the  inherent  safety  of  large  multi-engined 
propeller  aircraft  where  a  flight  may  be  aborted  due  to  a 
mechanical  failure  with  a  lower  probability  of  an  accident 
resulting  frcm  the  mechanical  failure. 

The  inclusion  of  pilot  oriented  variables  in  the  area  of 
carrier  landings  casts  doubt  upon  the  validity  of  the 
predictive  equation  because  many  of  the  aircraft  are  not 
carrier-based  and  their  pilots  do  not  record  carrier 
landings.  The  equation  is  still  considered  valid  by  the 
author  in  that  a  value  of  zero  was  recorded  for  those  pilots 
with  no  carrier  landings  and  if  the  regression  technique 
still  selects  that  variable  it  is  due  to  the  correlation 
interactions  with  that  variable  and  the  combined  independent 
variables   included   in   the   equation   with  the   dependent 
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variable  rate. 


TABLE  7 

PROPELLER  AIRCRAFT  VARIABLE  SUMMARY 

VARIABLE  PROPS 

NITE902  0.09764 

CLNITE2  -0.22505 

CLDDNA  0.21845 

TTNITE  0.554  15 

AGEDNA  0.19878 

TABLE  ENTRIES  ARE  THE  CORRELATION  COEFFICIENT  OF  THE 

DISPLAYED  VARIABLE  WITH  RATE 


TABLE  8 

PROPELLER  AIRCRAFT  BASIC  VARIABLE  SUMMARY 

VARIABLE  PROPELLERS  FREQUENCY 

AGE  1  1 

TTIME  2  2 

TOT90  0  0 

DAY90  0  0 

NITE90  2  2 

CLDAY  1  1 

CLNITE  1  1 

DNA  1  1 

ACTOUR  0  0 

ACHRS  0  0 
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Table  7  shows  that  only  one  variable,  CLNITE2,  is 
negatively  correlated  with  rate,  while  the  remaining 
variable  forms  are  all  positively  correlated.  Table  8 
reflects  the  degree  of  usage  of  each  basic  variable  with 
total  time  and  night  hours  being  used  twice  and  the 
remaining  basic  varaibles  one  time  or  not  at  all. 


D.   HELICOPTERS 

The  category  helicopters  consists  of  the  aggregate  of 
H-1,  H-2,  H-3,  H-46  and  H-53.  This  category,  like  that  of 
Propellers  did  not  provide  sufficient  data  to  conduct 
independent  analysis  by  type  aircraft.  The  aggregate 
yielded  a  sample  size  of  thirty-three  cases. 

The  analysis  yielded  only  two  predictive  equations  for 
the  seven  regression  packages.  The  equation  provided  by  the 
basic  variables  was  a  judged  •best1  and  is: 

Hate  (HELO)  =  0.00062  (TTIME)  +  0.65405. 

This  equation,  significant  at  the  ninety  percent  level 
accounts  for  9.30  percent  of  the  variance  in  rate.  The 
second  equation  provided  by  the  remaining  five  regressions 
accounted  for  11.45  percent  of  the  variance  in  rate.  The 
difference  cf  2. 15  percent  was  not  deemed  sufficient 
increase  to  use  the  second  equation  which  consisted  of  the 
variable  TTIME  squared. 
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V .   DISCOSS 10 N 

It  is  interesting  to  note  that  even  though  the  variable 
AGE  was  the  most  significient  single  variable  in  the  overall 
equation  arrived  at  in  the  study  by  Stucki  and  Maxwell  that 
AGE  appeared  only  once  in  the  current  study.  The  variable 
AGE  appeared  as  a  cross  product  with  TTIME  in  the  prediction 
equation  for  propeller  aircraft.  The  current  study  employs 
the  variable  DNA  which  was  not  used  by  Stucki  and  Maxwell. 
The  variable  DNA  was  used  in  some  functional  form  seven 
times  and  represents  the  highest  single  useage  of  any  basic 
variable.  The  simple  correlation  between  AGE  and  DNA  is 
quite  high  in  each  of  the  eight  categorys.  Intuitively  this 
implys  that  one  or  the  other  will  dominate  and  both  will  try 
to  account  for  the  same  portion  of  variance  in  rate 
explainable  by  this  type  of  variable.  A  similar  trend 
appears  in  looking  at  the  usage  of  TOT90,  DAZ90  and  NITE90. 
As  the  sum  of  DAI90  and  NITE90  equals  TOT90  it  would  be 
expected  that  one  of  the  variables  would  dominate  the 
predictions.  This  is  in  fact  the  case  as  NITE90  is  used  six 
times,  DAY90  three  times  and  TOT90  enters  only  in  the 
alternate  best  equation  for  Attack  aircraft.  The  variables 
TTIME  and  CLDAI  are  each  used  six  times  in  some  functional 
form  while  CLNITE  appears  four  times.  The  aircraft  oriented 
variables  appear  a  total  of  six  times  in  some  functional 
form,  four  times  for  ACTOUR  and  twice  for  ACHES. 

Although  some  of  the  Regression  Packages  yielded  the 
same  equation  within  categories,  Regression  Package  IV  can 
be  credited  with  the  best  equation  in  five  out  of  eight 
categories.  Regression  IV  provided  the  best  equation  in  the 
four  attack  aircraft  categories  and  in  the  category  Fighter 
aircraft.  Regression  Package  V  provided  the  Propeller 
equation,    Regression   II    the  Helicopter   equation   and 
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Regression  III  the  F-4  equation.  The  predictive  equation 
for  F-4  is  the  only  case  where  the  best  equation  was 
provided  by  a  regression  package  that  contained  only  pilot 
oriented  variables.  The  category  consisting  of  Helicopters 
was  the  only  case  where  the  transformed  variables  did  not 
provide  a  large  improvement  over  pure  basic  variables  in  the 
predictive  equation  generated. 

The  majority  of  the  variables  entered  in  the  eight 
predictive  equations  were  of  the  variable  squared  or  the 
square  root  of  the  variable,  thirteen  and  fourteen  times 
respectively.  Three  cross  products  used  six  variables  and 
the  basic  variables  appeared  six  times. 

There  does  not  appear  to  be  any  trend  or  tendency  for 
any  particular  basic  variable  to  be  consistent  over  the 
range  of  the  eight  categories  considered.  If  the  hypothesis 
that  the  older  more  experienced  pilot  is  a  safer  pilot  is 
valid  and  if  the  variables  of  AGE  and  DNA  can  be  considered 
as  measures  of  this  hypothesis,  then  the  author  would  expect 
the  simple  correlation  coefficients  of  these  variables  to  be 
negative.  This  is  not  the  case  as  five  of  the  sixteen 
coefficients  are  positive.  It  is  possible  that  due  to  the 
relatively  small  size  of  each  sample  and  the  fact  that  there 
are  months  where  only  one  accident  occured  that  a 
coefficient  could  be  only  slightly  positive  wirhout 
violating  the  hypothesis.  This  does  not  explain  the 
coefficients  of  the  A-7  category  (see  Table  9)  where  the 
coefficient  of  AGE  is  on  the  order  of  0.26  and  DNA  is  0.15. 
As  stated  previously  the  author  is  unable  to  explain  this 
phenomena.  Similar  arguments  can  be  generated  for  each  of 
the  ten  basic  variables.  The  closest  case  to  being 
consistent  in  sign  is  with  CLNITE  where  all  coefficients  are 
negative  except  that  of  A-6  which  is  0.07.  The  value  of 
0.07  for  A-6  combined  with  the  extremely  small  sample 
considered  in  this  case  (20  data  points)  leads  the  author  to 
discount  any  significance  in  sign  for  this  variable. 
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TABLE  9 


AGE 

TTIME 

TOT90 

DAY90 

NITE90 

CLDAY 

CLNITE 

ACTODR 

ACHRS 

DNA 

AGE 

TTIME 

TOT90 

DAY90 

NITE90 

CLDAY 

CLNITE 

ACTOUR 

ACHRS 

DNA 


ATTACK 

-0.27305 

-0.05110 

0.02845 

0.01701 

0.07302 

-0.19153 

-0.11847 

-0.33037 

0.31152 

-0.18404 

FIGHTER 

-0.006448 

0.15639 

0.11077 

0.23847 

-0.22743 

-0.16666 

-0.16156 

0.42709 

0.03460 

-0.02518 


A-4 

-0.12430 

-0.08715 

-0.01864 

-0.08675 

0.12366 

0.05906 

-0.01026 

rO. 11151 

0.29628 

-0.25457 

F-4 

0.00785 

0.01361 

0.  14855 

0.20732 

-0. 14624 

0. 10926 

-0.12479 

0.19403 

0.20765 

-0.05232 


A-6 
-0.30084 
-0.09278 
-0.02922 

0.02350 
-0.11615 
-0.02135 

0.07686 
-0.17445 

0.10899 
-0.28943 

PROPS 
-0.06703 

0.21211 

0.08224 
-0.00069 

0.16823 

0.20479 
-0.22141 

0.14963 

0.12285 
-0.05969 


A-7 

0.26525 

0.  17624 

-0.  03100 

-0.07358 

0.073  86 

0.21725 

-0.03701 

-0. 16477 

-0.  11530 

0.  15431 

HELOS 

0.09091 

0.30509 

0.09746 

0.  12454 

-0.00036 

** 

** 

-0.07866 

0.08244 

0.04829 


**  no  carrier  landings  recorded 
TABLED  VALUES  ARE  THE  CORRELATION  COEFFICIENTS  OF  THE  BASIC 
VARIABLES  WITH  ACCIDENT  RATE  BY  CATEGORY 

The  net  effect  on  Rate  by  the  combined  functional  forms 
of  a  basic  variable  was  discussed  in  the  results  by 
category.  It  should  be  noted,  however,  that  adjusting  the 
values  of  a  basic  variable  to  bring  about  a  decrease  in  the 
accident  rate  could  potentially  cause  one  of  the  other 
measures  to  shift  in  such  a  way  that  the  total  net  effect 
would  be  to  increase  the  accident  rate. 
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VI.   RECOMMENDATIONS 

■■■■'■  ■  ■■  .  ■  i  -  — — . . .-  — T 

The  current  study  consisted  of  analysis  of  aircraft 
accident  rate  by  type  aircraft.  In  this  approach  the  size 
of  the  communities  and  the  resulting  size  of  the  data  base 
are  such  that  the  analysis  is  constrained  by  the  degrees  of 
freedom  available  in  regression  techniques.  As  a  future 
study  the  techniques  and  hypotheses  employed  in  this  study 
would  be  a  useful  starting  point.  The  necessity  of  a  larger 
data  base  would  be  overcome  by  time  as  the  inventory  studied 
is  not  that  different  from  the  current  Navy  inventory.  The 
sensitivity  of  some  of  the  basic  variables  employed  in  this 
study  suggest  that  future  studies  procure  additional  data  of 
the  following  types: 

1)  In  addition  to  the  number  of  day  and  night  carrier 
landings  in  the  past  thirty  days,  a  numerical  grade  of  the 
quality  of  each  landing  made  should  be  included. 

2)  A  breakdown  of  the  hours  flown  in  the  preceeding 
ninety  days  to  include,  for  example,  flight  hours  in  past  24 
hours,  flight  hours  in  past  72  hours.  This  would  allow 
inclusion  of  concepts  of  fatigue  versus  proficiency. 

3)  In  addition  to  AGE  and  DNA,  the  number  of  months  in 
operational  flying  billets  and  the  number  of  months  in 
current  tour. 

The  data  base  employed  contained  much  information  on 
accidents  and  allows  constructing .a  profile  of  the  pilot  who 
had  an  accident.  The  single  most  severe  hinderance  to  this 
author  in  drawing  conclusions  was  the  lack  of  adequate  or 
equal  knowledge  of  the  pilot  who  did  not  have  an  accident. 
It  is  recommended  that  prior  to  any  future  studies  of  this 
type  the  analyst  procure  data  on  accident  free  pilots  with 
as  many  variables  in  common  with  the  accident  involved  pilot 
as  feasible.   The  hinderance   to   this   author   was   that   a 
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profile  of  the  accident  involved  pilot  may  be  identical  to  a 
profile  of  the  non-accident  involved  pilot.  Without  the 
results  of  this  comparison  the  usefulness  of  the  predictive 
equations  for  prediction  cannot  be  demonstrated 
statistically.  It  is  possible  however  that  the  equations 
could  be  validated  by  using  them  to  attempt  to  predict  and 
comparing  the  actual  resulting  rates. 

The  real  benefits  of  this  study  are  in  the  analysis  of 
the  variables  that  enter  the  equations  and  by  using  the 
frequency  of  appearance  in  planning  future  studies  with  even 
greater  detail  in  those  areas  where  the  variables  appear  to 
contribute  the  most. 

While  this  study  is  somewhat  broad  in  scope  it  does 
provide  encouragement  for  future  efforts  along  this  line  of 
reasoning.  The  ever  increasing  necessity  to  reduce  loss  in 
human  life  and  dollars  due  to  aircraft  accidents  provides 
the  incentive. 
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APPENDIX  A 


AVERAGE  MONTHLY  DATA  POINT  VALUES 


This  appendix  contains  the  average  monthly  data  point 
values  for  the  basic  variables  and  the  dependent  variable, 
Rate.  Each  aircraft  type  or  community  examined  in  the  study 
is  recorded  in  a  table. 

The  dependent  variable  Rate  is  the  aircraft  accident 
rate  per  ten  thousand  hours.  Rate  is  calculated  by  taking 
the  number  of  aircraft  accidents  for  the  month  times  ten 
thousand  hours  and  dividing  by  the  total  number  of  hours 
flown  by  that  type  aircraft  for  the  month. 
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